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Abstract. We study the expected topological properties of Cech 
and Vietoris-Rips complexes built on random points in R'^. We find 
higher dimensional analogues of known results for connectivity and 
component counts for random geometric graphs. However, higher 
homology H^. is not monotone when fc > 0. 

In particular for every A; > we exhibit two thresholds, one 
where homology passes from vanishing to nonvanishing, and an- 
other where it passes back to vanishing. We give asymptotic formu- 
las for the expectation of the Betti numbers in the sparser regimes, 
and bounds in the denser regimes. The main technical contribu- 
tion of the article is the application of discrete Morse theory in 
geometric probability. 

1. Introduction 

The random geometric complexes studied here are simphcial com- 
plexes built on an i.i.d. random points in Euclidean space R'^. We 
identify here the basic topological features of these complexes. In par- 
ticular, we identify intervals of vanishing and non-vanishing for each 
homology group Hk, and give asymptotic formulas for the expected 
rank of homology when it is non- vanishing. 

There are several motivations for studying this. The area of topo- 
logical data analysis has been very active lately |29l [12] , and there is 
a need for a probabilistic null hypothesis to compare with topological 
statistics of point cloud data [8j. 

One approach to this problem was taken by Niyogi, Smale, and 
Weinberger ^24j, who studied the model where n points are sampled 
uniformly and independently from a compact manifold M embedded 
in M"^, and estimates were given for how large n must be in order to 
"learn" the topology of M with high probability. Their approach was 
to take balls of radius r centered at the n points and approximate the 
manifold by the Cech complex; provided that r is chosen carefully, once 
there are enough balls to cover the manifold, one has a finite simplicial 
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complex with the homotopy type of the manifold so in particular one 
can compute homology groups and so on. 

The main technical innovation in [23] is a geometric method for 
bounding above the number of random balls needed to cover the man- 
ifold, given some information about the curvature of the manifold's 
embedding. The assumption here is that one already knows how large 
r must be, or that one at least has enough information about the ge- 
ometry of the embedding of M in order to determine r. (In a second 
article, they are able to recapture the topology of the manifold, even in 
the more difficult setting when Gaussian noise is added to every sam- 
pled point [25] . Still, one needs some information about the embedding 
of the manifold.) 

In this article we study both random Vietoris-Rips and Cech com- 
plexes for fairly general distributions on Euclidean space M.'^, and most 
importantly, allowing the radius of balls r to vary from to oo. We 
identify thresholds for non- vanishing and vanishing of homology groups 
Hk and also derive asymptotic formulas and bounds on expectations 
of the Betti numbers (3k in terms of n and r. It is well understood in 
computational topology that persistent homology is more robust than 
homology alone (see for example the stability results of Cohen-Steiner, 
Edelsbrunner, and Harer [lO]), and one might not know anything about 
the underlying space, so in practice one computes persistent homology 
over a wide regime of radius [29] . 

There is also a close connection to geometric probability, and in 
particular the theory of geometric random graphs. Some of our results 
are higher-dimensional analogues of thresholds for connectivity and 
component counts in random geometric graphs due to Penrose [26] , 
and we must also use Penrose's results several times. However, an 
important contrast is that the properties studied here are decidedly 
non-monotone. In particular, for each k there is an interval of radius 
r for which the homology group 7^ 0, and with the expected rank 
of homology E[Pk\ roughly unimodal in the radius r, but we also show 
that for large enough or small enough radius, = 0. 

This paper can also be viewed in the context of several recent articles 
on the topology of random simplicial complexes [HI |23l |2l [181 IIHl I2Z] • 
This article discusses a fairly general framework for random complexes, 
since one has the freedom to choose the underlying density function, 
hence an infinite- dimensional parameter space. 

The probabilistic method has given non-constructive existence proofs, 
as well as many interesting and extremal examples in combinatorics 
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[T], geometric group theory [15j, and discrete geometry [22]. Random 
spaces will likely provide objects of interest to topologists as well. 

The problems discussed here were suggested, and the basic regimes 
described, in Persi Diaconis's MSRI talk in 2006 \TT\. Some of the 
results in this article may have been discovered concurrently and in- 
dependently by other researchers; it seems that Yuliy Barishnikov and 
Shmuel Weinberger have also thought about similar things [3]. How- 
ever, we believe that this article fills a gap in the literature and hope 
that it is useful as a reference. 

1.1. Definitions. We require a few preliminary definitions and con- 
ventions. 

Definition 1.1. For a set of points X C M°', and positive distance r, 
define the geometric graph G{X; r) as the graph with vertices V{G) = X 
and edges E{G) = {{x,y} \ d{x,y) < r}. 

Definition 1.2. Let f : be a probability density function, 

let Xi,X2, ... be a sequence of independent and identically distributed 
d- dimensional random variables with common density f , and let X„ = 
{xi, X2, ■ . ■ , Xn}- The geometric random graph G(X„; r) is the geomet- 
ric graph with vertices Xn, and edges between every pair of vertices u, v 
with d{u, v) < r. 

Throughout the article we make mild assumptions about /, in par- 
ticular we assume that / is a bounded Lebesgue-measurable function, 
and that 

/ f{x)dx = 1 

(i.e. that / actually is a probability density function). 

In the study of geometric random graphs [26] r usually depends on 
n, and one studies the asymptotic behavior of the graphs as n — > oo. 

Definition 1.3. We say that G{Xn]rn) asymptotically almost surely 
(a.a.s.) has property V if 

Pr(G(X„;r) gP) ^1 

as n — 7- oo. 

The main objects of study here are the Cech and Vietoris-Rips com- 
plexes on X„, which are simplicial complexes built on the geomet- 
ric random graph G(X„;r). A historical comment: the Vietoris-Rips 
complex was first introduced by Vietoris in order to extend simplicial 
homology to a homology theory for more general metric spaces [28j . 
Eliyahu Rips applied the same complex to the study of hyperbolic 
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groups, and Gromov popularized the name Rips complex [Tl]. The 
name "Vietoris-Rips complex" is apparently due to Hausmann [T7]. 

Denote the closed ball of radius r centered at a point p by B{p, r) = 
{x I d{x,p) < r}. 

Definition 1.4. The random Cech complex C{Xn',r) is the simplicial 
complex with vertex set X„, and a a face of C{Xn; r) if 

fl E(x„r/2)^0. 

Definition 1.5. The random Vietoris-Rips complex R{Xn',r) is the 
simplicial complex with vertex set X„, and a a face if 

B{xi,r/2)r}B{xj,r/2) ^ 

for every pair Xi,Xj G a. 

Equivalently, the random Vietoris-Rips complex is the clique com- 
plex of G{Xn] r). 

We are interested in the topological properties, in particular the van- 
ishing and non- vanishing, and expected rank of homology groups, of the 
random Cech and Vietoris-Rips complexes, as r varies. Qualitatively 
speaking, the two kinds of complexes behave very similarly. However 
there are important quantitative differences and one of the goals of this 
article is to point these out. 

Throughout this article, we use Bachmann-Landau big-0, little-0, 
and related notations. In particular, for non-negative functions g and 
h, we write the following. 

• g{n) = 0{h{n)) means that there exists Uq and k such that for 
n > uq, we have that g{n) < k ■ h{n). (i.e. g is asymptotically 
bounded above by h, up to a constant factor.) 

• g{n) = Q{h{n)) means that there exists no and k such that for 
n > uq, we have that g{n) > k ■ h{n). (i.e. g is asymptotically 
bounded below by /i, up to a constant factor.) 

• g{n) = Q{h{n)) means that g{n) = 0{h{n)) and g{n) = Q{h{n)). 
(i.e. g is asymptotically bounded above and below by h, up to 
constant factors.) 

• g{n) = o{h{n)) means that for every e > 0, there exists 
such that for n > uq, we have that g{n) < e ■ h{n). (i.e. g is 
dominated by h asymptotically.) 

• g{n) = u{h{n)) means that for every A; > 0, there exists hq such 
that for n > uq, we have that g{n) > k ■ h{n). (i.e. g dominates 
h asymptotically.) 
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When we discuss homology Hk we mean either simphcial homology 
or singular homology, which are isomorphic. Our results hold with 
coefficients taken over any field. 

Finally, we use /i(S') to denote Lebesgue measure for any measurable 
set S" C M*^, and to denote the Euclidean norm of x G M*^. 

2. Summary of results 

It is known from the theory of random geometric graphs ^26j that 
there are four main regimes of parameter (sometimes called regimes), 
with qualitatively different behavior in each. The same is true for the 
higher dimensional random complexes we build on these graphs. The 
following is a brief summary of our results. 

In the subcritical and critical regimes, our results hold fairly gen- 
erally, for any distribution on with a bounded measurable density 
function. 

In the subcritical regime, r = o{n^^^'^), the random geometric graph 
G{Xn] r) (and hence the simplicial complexes we are interested in) con- 
sists of many disconnected pieces. Here we exhibit a threshold for Hk, 
from vanishing to non-vanishing, and provide an asymptotic formula 
for the kth Betti number E[l3k], for k > 1. 

In the critical regime, r = Q{n~^^^), the components of the random 
geometric graph start to connect up and the giant component emerges. 
In other words, this is the regime wherein percolation occurs, and it is 
sometimes called the thermodynamic limit. Here we show that E[l3k] = 
Q{n) and Var[/3A:] = 0(n) for every k. 

The results in the subcritical and critical regimes hold fairly gen- 
erally, for any distribution on M.'^ with a bounded measurable density 
function. In the supercritical and connected regimes, our results are for 
uniform distributions on smoothly bounded convex bodies in dimension 
d. 

In the supercritical regime, r = c<;(n~^/'^). We put an upper bound 
on E[Pk] to show that it grows sub-linearly, so the linear growth of the 
Betti numbers in the critical regime is maximal. Here our results are 
for the Vietoris-Rips complex, and the method is a Morse-theoretic ar- 
gument. The combination of geometric probability and discrete Morse 
theory used for these bounds is the main technical contribution of the 
article. 

The connected regime, r = il((\ogn/nY^'^), is where (j(X„;r) is 
known to become connected [26]. In this case we show that the Cech 
complex is contractible and the Vietoris-Rips complex is approximately 
contractible, in the sense that it is /c-connected for any fixed k. (This 
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means that the homotopy groups tTj vanish for i < k, which imphes in 
turn that the homology groups Hi vanish for i < k as welL) 

Despite non-monotonicity, we are able to exhibit thresholds for van- 
ishing of iffc. For every k > 1, there is an interval in which Hj. 7^ 
and outside of which Hk = 0, so every higher homology group passes 
through two thresholds. 

The rest of the article is organized as follows. In Section [3] we con- 
sider the subcritical regime of radius, in Section H] the critical regime, 
in Section \5\ the supercritical regime, and in Section E] the connected 
regime. In Sections and Owe assume that the underlying distribution 
is uniform on a smoothly bounded convex body mostly as a matter of 
convenience, but similar methods should apply in a more general set- 
ting. In Section [7] we discuss open problems and future directions. 



3. Subcritical 

In this regime, we exhibit a vanishing to non- vanishing threshold for 
homology Hj^, and in the non- vanishing regime compute the asymp- 
totic expectation of the Betti numbers (3^, for k > 1. (The case k = 0, 
the number of path components, is examined in careful detail by Pen- 
rose [26], Ch. 13.) As a corollary, we also obtain information about 
the threshold where homology passes from vanishing to non-vanishing 
homology. We emphasize that the results in this section do not depend 
in any essential way on the distribution on R'^, although we make the 
mild assumption that the underlying density function / is bounded and 
measurable. 

3.1. Expectation. 

Theorem 3.1. [Expectation of Betti numbers, Vietoris-Rips complex] 
For d > 2, k > 1, e > 0, and r„ = 0{n~^^'^~'^) , the expectation of the 
kth Betti number E[Pk] of the random Vietoris-Rips complex R{Xn', r) 
satisfies 

^2fc+2^d(2fc+l) 

as n ^ 00 where Ck is a constant that depends only on k and the 
underlying density function f . 

(We note that this result holds for all k, even when k > d.) 
Using similar methods, we also prove the following about the random 
Cech complex. 
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Theorem 3.2. [Expectation of Betti numbers, Cech complex] For d > 
2, l<k<d — l,e>0, and r = 0{n^^^'^~'^) , the expectation of the kth 
Betti number E[Pk\ of the random Cech complex C{Xn;r) satisfies 

as n ^ oo where Dk is a constant that depends only on k and the 
underlying density function f . 

One feature that distinguishes the Cech complex from the Vietoris- 
Rips complex is that a Cech complex is always homotopy equivalent to 
whatever it covers (this follows form the nerve theorem, i.e. Theorem 
10.7 in So in particular Hk = when k > d. 

In both cases we will see that almost all of the homology is con- 
tributed from a single source: whatever is the smallest possible vertex 
support for nontrivial homology. For the Vietoris-Rips complex this 
will be the boundary of the cross-polytope, and for the Cech complex 
the empty simplex. 

Definition 3.3. The [k + 1)- dimensional cross-polytope is defined to 
be the convex hull of the 2k + 2 points {icj}, where Ci, 62, ... , e^+i are 
the standard basis vectors of M.''^^ . The boundary of this polytope is a 
k-dimensional simplicial complex, denoted Ok- 

Simplicial complexes which arise as clique complexes of graphs are 
sometimes called flag complexes. A useful fact in combinatorial topol- 
ogy is the following; for a proof see [12] . 

Lemma 3.4. If A is a flag complex, then any nontrivial element of 
k-dimensional homology Hk{A) is supported on a subcomplex S with at 
least 2k -\- 2 vertices. Moreover, if S has exactly 2k + 2 vertices, then 
S is isomorphic to Ok. 

We also use results for expected subgraph counts in geometric ran- 
dom graphs. 

Recall that a subgraph if < G is said to be an induced subgraph if 
for every pair of vertices x,y & V{H), we have {x,y} is an edge of H 
if and only if {x, y} is an edge of G. 

Definition 3.5. A connected graph is feasible if it is geometrically 
realizable as an induced subgraph. 

For example the complete bipartite graph Ki j is not feasible, since 
it is not geometrically realizable as an induced subgraph of a geometric 
graph in M^, since there must be at least one edge between the seven 
degree-one vertices. 
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Denote the number of induced subgraphs of G(X„; r) isomorphic 
to H by Gn{H), and the number of components isomorphic to H by 
Jn{H). Recall that / is the underlying density function. For a feasible 
subgraph H of order k, and y e (R'')'^ define the indicator function 
huiy) on sets y oi k elements in R'' by = 1 if the geometric 

graph 1) is isomorphic to H, and otherwise. Let 

fiH = k\~^ f{x)''dx hH{{0,Xi,...,Xk-i})d{xi,...Xk-i). 

Penrose proved the following [26] . 

Theorem 3.6 (Expectation of subgraph counts, Penrose). Suppose 
that\imn^oo{f^) = 0, and H is a connected feasible graph of order k > 2. 
Then 

lim r~'^'^^-^'>n-''E{Gn{H)) = lim r'^'^^-^^n-^ E{Jn{H)) = fin- 

n— >oo n—^oo 

Together with our topological and combinatorial tools, Theorem 13.61 
will be sufficient to prove Theorem 13.11 To prove Theorem 13.21 we 
also require a hypergraph analogue of Theorem 13. 6[ established by the 
author and Meckes in Section 3 of [20], which we state when it is needed. 



Proof of Theorem \3.1[ The intuition is that in the sparse regime, al- 
most all of the homology is contributed by vertex-minimal spheres. 

Definition 3.7. For a simplicial complex A, let Ok{A) (or if context 
is clear) denote the number of induced subgraphs of A combinatorial ly 
isomorphic to the 1-skeleton of the cross-polytope Ok, and let Ofc(A) 
denote the number of components of A combinatorially isomorphic to 
the 1-skeleton of the cross-polytope Ok- 

Definition 3.8. Let /^*(A) denote the number of k- dimensional faces 
on connected components with exactly i vertices. Similarly, let f^\A) 
denote the number of k-dimensional faces on connected components 
containing at least i vertices. 

A dimension bound paired with Lemma 13.41 yields 

(3.1) Ofe < < Ofc + ^+1 

One could work with Z^^^"*"^ directly, but it turns out to be sufficient 
to overestimate f^"^^^^ as follows. For each A;- dimensional face in a 
component with at least 2/c+3 vertices, extend to a connected subgraph 
with exactly 2/c + 3 vertices and (^^2 ^) + k + 2 edges. 
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Figure 1. The case k = 2: the seventeen isomorphism 
types of subgraphs which arise when extending a 3-chque 
to a connected graph on 7 vertices with 7 edges. Each 
subgraph isomorphic to one of these can contribute at 
most 1 to the sum bounding the error term f^"^. 



For example, let k = 2; then 



(3.2) 



02 < /32 < 02 + fp- 



Up to isomorphism, the seventeen graphs that arise when extending a 
2-dimensional face (i.e. a 3-clique) to a minimal connected graph on 7 
vertices are exhibited in Figure [1] 



In particular, /^^ < '^^L-^ Sj, where Si counts the number of sub- 
graphs isomorphic to graph i for some indexing of the seventeen graphs 
in Figure [H 

Moreover, as noted in [26j , the number of occurences of a given sub- 
graph r on f vertices is a positive linear combination of the induced 
subgraph counts for those graphs on v vertices which have F as a sub- 
graph. 

For an example of this, let Gh denote the number of induced sub- 
graphs of G isomorphic to H, and let Gh denote the number of sub- 
graphs (not necessarily induced) of G isomorphic to H. If P3 is the 
path on 3 vertices and is the complete graph on 3 vertices, then 



^17 



G 



P3 



SGks + G 



So for each i we can write Si as a positive linear combination of 
induced subgraph counts, and every type of induced subgraphs has 
exactly 7 vertices. 
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We take expectation of both sides of Equation 13.21 applying linearity 
of expectation, to obtain 

E[d,] < E[f32] < E[o2] + E[fP] 

17 

<E[52] + E[J2 s^] 

i=l 

17 
i=l 

For each i, E[si\ = 0{n'^r^'^), by Theorem 13.61 On the other hand, 
E[o2] = Q{n^r^'^), also by Theorem 13.61 Since we are assuming that 
nr'^ — )■ as — > oo, we have shown that E[f^'''] = o{E[d2]). We con- 
clude that E[f32]/E[d2] ^ 1 as n ^ oo. This gives E[f32] = Q{n^r^'^), 
as desired. 

The proof for > 2 is the same. In general the number of graphs on 
2k + 3 vertices that can arise from the algorithm above is a constant 
that only depends on k, so denote this constant by c^. 

So in general we will have 

E[dk]<E[Pk]<E[ok]+E[§"'+'] 

Cfe 

<E[ok]+E[J2 Si] 

i=l 

Cfc 

<E[02] + Y,E[si]. 

2=1 

For each z = 1, 2, . . . , we have 
and on the other hand 

Since nr'^ — )■ 0, we conclude that E[f3k]/ E[dk] — )■ 1, and 

The case A; = 1 is slightly different. There are several ways of ex- 
tending a 2-clique (i.e. an edge) to a connected graph on 5 vertices and 
4 edges. In this case the graph must be a tree, and there are three 
isomorphism types of trees on five vertices, shown in Figure [2l But in 
this case counting these subgraphs will result in an underestimate for 



RANDOM GEOMETRIC COMPLEXES 



11 



Figure 2. The case k = 1: the three isomorphism types 
of trees on five vertices. Each subgraph isomorphic to one 
of these can contribute at most 4 to the sum bounding 
the error term ff^. 




Figure 3. The regular 2/c-gons prove that the 1- 
skeleton of the cross-poytope Ok is geometrically feasible 
in the plane for every k. If r is slightly shorter than the 
length of the main diagonal, components combinatorially 
isomorphic to this contribute to Pk in the Vietoris-Rips 
complex. 

/;f ^. However, each tree has only four edges, and so one can obtain the 
bound 

/r' <4(ti + t2 + t3), 

where ti,t2;^3 count the number of subgraphs isomorphic to the three 
trees in Figure [2j The argument is then the same as in the case k >2. 

This completes the proof, modulo one small concern: we must make 
sure that the octahedral 1-skeletons are geometrically feasible. It is 
perhaps surprising that this is the case, even when d = 2. But the 
regular 2fc-gons provide examples of geometic realizations of the 1- 
skeleton of for every k, as in Figure El (This fact was previously 
noted by Chambers, de Silva, Erickson, and Christ in ^9j.) 

□ 

Proof of Theorem \3.2[ The argument for the Cech complex proceeds 
along the same lines, mutatis mutandis, but with one important differ- 
ence. Again the dominating contribution to (3k will come from vertex- 
minimal A;-dimensional spheres, but for a Cech complex the smallest 
possible vertex support that a simplicial complex with nontrivial Hk 
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can have is A; + 2 vertices, coming from the boundary of a (A; + 1)- 
dimensional simplex. 

Let Sk denote the number of connected components isomorphic to 
the boundary of a (A; + l)-dimensional simplex. By the same argument 
as before we have 

E[Sk]<E[(3k]<E[Sk]+E[f^''+\ 

Deciding whether some set of A; + 2 vertices span the boundary of 
a (A; + l)-dimensional simplex depends on higher intersections, so in 
particular when k > 2 the faces of the Cech complex are not determined 
by the underlying geometric graph. It is proved in Section 3 of [20J that 
as long as r = o(n^^/'^) then E[Sk] = Qi^n'^'^'^r^''^^^'^). On the other hand 
we have E[f^^^^] = 0(n^+V('=+2)d)^ before, since r = o{n-^^'^) this 
is enough to give that 

lim E[f3k]/E[Sk] = 1, 
and then E[/3k] = 6(n''+^r(''+^)'^) as desired. 

□ 

3.2. Vanishing / non-vanishing threshold. To state the following 
theorems we assume that d > 2 and A; > 1 are fixed and that r is still 
in the sparse regime, i.e. that r = o{n~^^^). 

Theorem 3.9 (Threshold for non-vanishing of Hj^ in the random Vi- 
etoris-Rips complex). 

(1) If 

f 2fc+2 \ 

r = o\n d(2fc+i) j ^ 

then a.a.s. Hk{VR{n;r)) = 0, and 

(2) ^/ 

/ 2fc+2 \ 

r = CO in '*{2fc+i) j ^ 

then a.a.s. Hjs{VR{n;r)) ^ 0. 

Proof. The first statement follows directly from Lemma 13.41 and Theo- 
rem [221 i-e. if r is too small then the connected components are simply 
too small to support nontrivial homology. 

For the second statement, we have from Theorem 13 . II that given this 
hypothesis on r we have that E[l3k] — t- oo. This by itself is not enough 
to establish that f3k ^ a.a.s. However it is established in Section 4 
of [20j that Var[/3fc] is of the same order of magnitude as E[f3k], so this 
follows from Chebyshev's inequality, as in yj. Chapter 4. 

□ 
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The corresponding result for Cech complexes is the following. 

Theorem 3.10 (Threshold for non- vanishing of Hk in the random 
Cech complex). 

(1) If 

( fc+2 

r = o in 



then a.a.s. Hk{VR{n;r)) = 0, and 
(2) ^/ 

fc+2 



then a.a.s. HkiVRin^r)) ^ 0. 

Proof. The proof is identical. The needed result for bounding the vari- 
ance of Var[/3fc] is established in Section 3 of [20]. □ 

4. Critical 

The situation in the critical regime (or thermodynamic limit) is more 
delicate to analyze. We are still able to compute the right order of 
magnitude for E[l3k\. it grows linearly for every k. 

Theorem 4.1. For either the random Vietoris-Rips and Cech com- 
plexes on a probability distribution on M.'^ with bounded measurable den- 
sity function, if r = Q{n~^^^) and k > 1 is fixed, then E[(3k] = Q{n). 

Proof. The proof is the same as in the previous section. For example, 
for the Vietoris-Rips complex we still have 

E[d,] < E[P,] < E[ok] + E[f^''+% 

Penrose's results for component counts extend in to the thermodynamic 
limit, so in particular E\ok] = Q{n) and E[f^'^^'^^] = 0{n). The desired 
result follows. □ 

The thermodynamic limit is of particular interest since this is the 
regime where percolation occurs for the random geometric graph [2B]. 
Bollobas recently exhibited an analogue of percolation on the /c-cliques 
of the Erdos-Renyi random graph [5]. It would be interesting to know 
if analogues of his result occurs in the random geometric setting. 

For example, define a graph with vertices for fc-dimensional faces, 
with edges between a pair whenever they are both contained in the 
same {k -\- l)-dimensional face. Does there exist a constant Ck > 
such that whenever 

lim nr'^ > Ck 
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there is a.a.s. a unique /c-dimensional "giant component" (suitably de- 
fined), and whenever 

hm nr'^ < Ck, 

n—^oo 

all the components s. "small"? 

5. Supercritical 

For this section and the next we assume that the underlying dis- 
tribution is uniform on a smoothly bounded convex body. (Recall 
that a smoothly bounded convex body is a compact, convex set, with 
nonempty interior.) This assumption is not only a matter of conve- 
nience - it would seem that some assumption on density must be made 
to make topological statements in the denser regimes. 

For example, the geometric random graph becomes connected once 
r = fi((log n/nY^'^) for a uniform distribution on a convex body, but for 
a standard multivariate normal distribution r must be much larger, r = 
f2((loglogn/ logn)^/^), before the geometric random graph becomes 
connected [26] . 

The supercritical regime is where r = uj{n~^^'^). In this section we 
give an upper bound on the expectation of the Betti numbers for the 
random Vietoris-Rips complex in this regime. This upper bound is 
sub-linear so this shows that the Betti numbers are growing the fastest 
in the thermodynamic limit. 

The main tool is discrete Morse theory - see the Appendix for the 
basic terminology and the main theorem. A much more complete (and 
very readable) introduction to discrete Morse theory can be found in 

m- 

Theorem 5.1. Let R{Xn',r) be a random Vietoris-Rips complex on n 
points taken i.i.d. uniformly from a smoothly hounded convex body K 
in . Suppose r = uj{n~^/'^), and write W = nr'^ . Then 

= O(W^^e-^^n) 

for some constant c > 0, and in particular E[Pk\ = o{n). 

Here c depends on the convex body K but not on k. In fact it is 
apparent from the proof that c depends only on the volume of K and 
not on its shape. 

Recall that /i(S') denotes the Lebesgue measure of S" C M*^, and ||x|| 
denotes the Euclidean norm of a; e W^. We require a geometric lemma 
in order to prove the main theorem. 
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Lemma 5.2 (Main geometric lemma). There exists a constant > 
such that the following holds. Let I > 1 and {i/q, . . . ,yi} C M.'^ be an 
{I + 1) -tuple of points such that 

WvoW < WviW < ■ ■ ■< Wvih 

and WniW > 1/2. // \\yQ — yi\\ > 1 and \\yi — yj\\ < 1 for every other 
< « < J < /, then the intersection 

I 

I = f]B{y,,l)nB{0,\\y,\\) 

2=1 

satisfies > e^. 

As the notation suggests, depends on d but holds simultaneously 
for all /. 

Proof of Lemma [37E Let y^ = (z/o + yi)/2 denote the midpoint of line 
segment yoyi- By assumption that ||yo ~2/i|| > !> "we have Hl/m — 2/o|| = 
\\ym — 2/1 II > 1/2- We now wish to check that ym is still not too far 
away from any yi with 2 < i < I. 

Let 6 be the positive angle between yo — y2 and yi — y2- Since 
Wuo "2/211 < 1, \\yi — 2/2II < 1, and \\yi — J/2II > 1, the law of cosines 
gives that 

(2/0 - 2/2) • (2/1 - 2/2) = II2/0 - 2/2IIII2/1 - 2/2 II cos 6* 

= ^(ll2/o - 2/2f + II2/1 - 2/2^ - II2/0 - 2/2f ) 
1 

Then 

||2/m - 2/2f = (2/m - 2/2) ■ (2/m - 2/2) 

= ((yo + 2/i)/2 - 2/2) ■ ((2/0 + 2/i)/2 - 2/2) 

= {{yo - 2/2)/2 + (2/1 - 2/2)/2) • ((2/0 - 2/2)/2 + (2/1 - 2/2)/2) 

= (l/4)(||yo - 2/2f + II2/1 - 2/2f + 2(1/0 - 2/2) • (2/1 - 2/2)) 
<(1/4)(1 + 1 + 2(1/2)) 
= 3/4, 



so 

WVm - 2/2II < V^/2. 
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The same argument works as written with y2 replaced by Ui with 3 < 
i <l. Now set p = 1 — By the triangle inequality B{ym,p) C 

B{yi, 1) for 1 < i < I. So we have that 

B{y^,p) n 5(0, llyill) C fl B{y,, 1) n B(0, \\y^\\). 

i=l 

By the triangle inequality we have that \\ym\\ < WviWi and it follows 
that 

p p) n 5(0, llyilD) > /X (i?(yi, p) n 5(0, . 

Since \\yi\\ > 1/2, the quantity /i (i?(?/i, p) fl 5(0, is bounded 

away from zero, and in fact it attains its minimum when = 1/2. 
Set ed equal to this minimum value of p(5(?/i, p) n5(0, and the 

statement of the lemma follows. 

□ 

Scaling everything in R'' by a linear factor of r we rewrite the lemma 
in the form in which we will use it. 

Lemma 5.3. [Scaled geometric lemma] There exists a constant > 
such that the following holds for every r > 0. Let I > 1 and 
{yo, . . . , yk} C M'^ be an (/ + l)-tuple of points, , such that 

WvoW < WviW < ■ ■ ■< \\yi\\ 

and (l/2)r < ||yi||. // \\yo — ViW > r and \\yi — yj\\ < r for every other 
< « < J < then the intersection 

I 

/ = f|5(y„r)n5(0,||yi||) 

4=1 

satisfies p(/) > e^r'^. 

We are ready to prove the main result of the section. 

Proof of Theorem I5.il By translation and rescaling if necessary, as- 
sume without loss of generality that 5(0, 1) C K. Since with prob- 
ability 1 no two points are the same distance to the origin, index the 
points Xn = {xi, . . . ,Xn} by distance to 0, i.e. 

||a^i|| < ||a^2|| < ■ ■ ■ < ||a;n||- 

Now we define a discrete vector field V on R{Xn', r) in the sense of 
discrete Morse theory, as discussed in the Appendix. 

Whenever possible pair face S = {xi-^,Xi^, . . . ,Xij} with face {xig}US 
with io < ii and io as small as possible. This can be done in any 
particular order or simultaneously, and still each face gets paired at 
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most once, as follows. A face S can not get paired with two different 
higher dimensional faces {xa} U S and {xi,} U S, since 5* will prefer the 
vertex with smaller index min{a, b}. On the other hand, it is also not 
possible for S to get paired with both a lower dimensional face and a 
higher dimensional face: Suppose S gets paired with {xa} U S. Then 
ll^all < ||s|| for every s G S", and no codimension 1 face F ^ S could 
also get paired with S, since F would prefer to get paired with {xa}UF. 

Hence each face is in at most one pair and is a well defined discrete 
vector field. Moreover, the indices are decreasing along any V^-path, so 
there are no closed V^-paths. Therefore \^ is a discrete gradient vector 



Let us bound the probability pk that a set of + 1 vertices span a 
/c-dimensional face in the Vietoris-Rips complex. Given the first vertex 
V, the other vertices would all have to fall in B[v,r), so pk = 0{r'^'^). 
Recall that we defined W = nr'^ and we rewrite this bound as 



Given that a set of A; + 1 vertices {xi^ , , • • • , } span a k- 
dimensional face F, how could F be critical (or unpaired) with respect 
to VI It must be that there is no common neighbor Xa of these vertices 
with a < 2i or else F would be paired up by adding the smallest index 
such point. On the other hand F would be paired with {xi^, . . . , Xj^,^^}, 
unless , . . . , Xj^^j had a common neighbor with smaller index than 
Xjj. So assuming that F is unpaired call this common neighbor xi^. 

We have satisfied the hypothesis of Lemma 15.31 with I = k + 1 and 
(If llyill < (l/2)r then either ||yo - ViW < r or \\yo\\ > \\yi\\, 
a contradiction to our assumptions.) So let 



and we know from the lemma that /i(/) > tdr'^ with > constant. 

If any vertices fall in region I then F would be paired; indeed if 
Xa & I then Xa would be a common neighbor of all the vertices in F, 
with a < ii. 

The probability that a uniform random point in K falls in region I 
is fi{I)/fi{K) > edr'^/fi{K), where ^{K) is the volume of the ambient 
convex body. By independence of the random points, we have that the 
probability Pc that F is critical (given that it is a face) is at most 



field. 



Pk = {{W/n)") . 



k+l 



I=f]B{x,„r)nB{0,\\x,,\\), 
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Now 

= 0{exp{-cW)), 



where c is any constant such that 



< c < 



Let Ck denote the number of critical /c-dimensional faces, and we 



have that 



n 



fw\ 



k 



-cW 



Since /3k < Ck in every case we have E[f3k] < E[Ck], and then 

E[Pk] = 0{W''e-'^n), 

as desired. 



□ 



6. Connected 

As in the previous section, we assume that the underlying distribu- 
tion is uniform on a smoothly bounded convex body K, but we now 
require r to be slightly larger, r = ^l{{logn/nY^'^). In this regime, the 
geometric random graph is known to be connected and we show 
here that the Cech complex is contractible, and the Vietoris-Rips com- 
plex "approximately contractible" (in the sense of /c-connected for any 
fixed k). 

Theorem 6.1 (Threshold for contractibility, random Cech complex). 
For a uniform distribution on a smoothly bounded convex body K in , 
there exists a constant c, depending on K, such that ifr > c{logn/nY^'^ 
then the random Cech complex C{Xn]r) is a.a.s. contractible. 

This is best possible up to the constant in front, since there also 
exists a constant c' such that if r < c'(logn/n)^/'^, then the random 
Cech complex is a.a.s. disconnected [26] . 
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Definition 6.2. Let A = {Ai,A2, . . . , Ak} be a cover of a topological 
space T. Then the nerve of the cover A, is the (abstract) simplicial 
complex Af {A) on vertex set [k] = {l,2,...,k} with a C [k] a face 
whenever f]j^^^ Ai ^ 0. 

The proof depends on the following result (Theorem 10.7 in P]). 

Theorem 6.3 (Nerve Theorem). If T is a triangulable topological 
space, and A = (Aj)jg[^.] is a finite cover of T by closed sets, such 
that every nonempty section Ai^ fl Ai^ fl ■ ■ ■ fl Ai^ is contractible, then 
T and the nerve M{T) are homotopy equivalent. 

Proof of Theorem \6.1\ Once r is sufficiently large the balls {-B(xj, r/2)} 
cover the smoothly bounded convex body A', and then Theorem 16.31 
gives that it is contractible. So to prove the claim it suffices to show 
that there exists a constant c > such that whenever r > c(logn/n)^/'^, 
the balls of radius r/2 a.a.s. cover K. There is no harm in assuming 
that r — )■ as n — )■ oo since the statement is trivial otherwise. 

Let 7/" denote the rf-dimensional cubical lattice, and XTJ^ the same 
lattice linearly scaled in every direction by a factor A > 0. With the 
end in mind we set A = r/{A\/d). (Note that since r = r(n), A is also 
a function of n.) Since K is bounded, only a finite number of the 
boxes of side length A intersect it. More precisely, it is easy to see that 

As n — oo and A — )■ almost all of these boxes are contained 
in K, but some are on the boundary. Denote by Sk the set of boxes 
completely contained in K. Suppose every box in Sk contains at least 
one point in Then the balls of radius r/2 cover i^T, as follows. 

First of all, each box has diameter \\fd = r/4. So a ball of radius 
r/2 with a point in one of these boxes not only covers the box itself, 
but all the boxes adjacent to it. Since every boundary box is adjacent 
to at least one box in Sk-, this is sufficient. 

For a box B G Sk, let Po denote the probability that box _BnX„ = 0. 
By uniformity of distribution this is the same for every i?, and by 
independence of the points we have that 

= (1 - A7/i(ir))" 

< exp(-A'^n/^(ir)) 

= exp(-(r/4v^)"'n//i(fs:)) 

= exp(— Cr'^n), 
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where 



C 



1 




There are at most boxes in Sk and 



= (l + o(l))/Cr^ 



so applying a union bound, the probabihty pj that at least one box in 
Sk fails to contain any points from X„ is bounded by 



So choosing Ck > (l/CY^'^ is sufficient to ensure that K is a.a.s. covered 
by the n random balls of radius r/2, and the desired result follows. □ 

The situation for the Vietoris-Rips complex is a bit more subtle 
since the nerve theorem is not available to us. Nevertheless, we use 
Morse theory to show in the connected regime that the Vietoris-Rips 
complex becomes "approximately contractible," in the sense of highly 
connected. 

Definition 6.4. A topological space T is k-connected if every map from 
an i- dimensional sphere ^ T is homotopically trivial for < i < k. 

For example, 0-connected means path-connected, and 1-connected 
means path-connected and simply-connected. The Hurewicz Theorem 
and universal coefficients for homology gives that if T is /c-connected, 
then Hi{T) = for i < k, with coefficients in Z or any field [T6] . 

Theorem 6.5 (fc- connectivity of the random Vietoris-Rips complex). 

For a smoothly bounded convex body K in W^, endowed with a uniform 
distribution, and fixed k > 0, if r > epilog n/nY^'^ then the random 
Vietoris-Rips complex R{Xn',r) is a.a.s. k-connected. (Here Ck > is 
a constant depending only on the volume fJ,{K) and k.) 



Pf < Npo 
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Proof of Theorem \6.5[ The proof is identical to the proof of Theorem 
15.1^ but now r is bigger and we obtain a stronger result. We place a 
discrete gradient vector field on -R(X„; r) in the same way described 
before, and repeat the same argument. If denotes the number of 
critical fc-dimensional faces, c is the constant in the statement of The- 
orem [5IT], and W = nr'^, then we have 



since nr'^ = cflogn. So we have that E[Ck] provided that > 



The same argument holds simultaneously for all smaller values of 
A; > 1 as well, so a.a.s. the only critical cell of dimension < fc is the 
vertex closest to the origin. By Theorem l7.2l in the Appendix, R{Xn', r) 
is a.a.s. homotopy equivalent to a CW-complex with one 0-cell and no 
other cells of dimension < k. This implies that R{Xn] r) is fc-connected 
by cellular approximation [16]. □ 

At the moment we do not know if there is a sufficiently large constant 
t > such that whenever r > t{logn/nY^'^, the random Vietoris-Rips 
complex R{Xn] r) is a.a.s. contractible. In fact it is not even clear 
that making r = u ((logn/ra)^/'^) is sufficient for this; this ensures that 
R{Xn', r) is a.a.s. fc-connected for every fixed k, but our results here 
do not rule out the possibility that there is nontrivial homology in 
dimension k where /c — ?• oo as n — )■ oo. 

6.1. Non-vanishing to vanishing threshold. Given a lemma about 
geometric random graphs which we state without proof, we have a 
second threshold where kth homology passes back from non- vanishing. 

First the statement of the lemma. (We are still assuming that the un- 
derlying distribution is uniform on a smoothly bounded convex body.) 

Lemma 6.6. Suppose H is a feasible subgraph, that r = f2(n~^/'^), and 
that r = o ((logri/n)^/'^) . Then the geometric random graph X{n;r) 
a.a.s. has at least one connected component isomorphic to H. 

This lemma should follow from the techniques in Chapter 3 of [26j. 
Given the lemma, we have the following intervals of vanishing and non- 
vanishing homology for VR{n; r). 



E[Ck] = O {W^e-'^n 



) 
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Theorem 6.7 (Intervals of vanishing and non-vanishing, random Vi- 
etoris-Rips complex). Fix k > 1. For a random Vietoris- Rips complex 
on a uniform distribution on a smoothly bounded convex body in M'^, 

(1) ^/ 

r = a ^n~<*(2fc+i) j or r = u ((log n/n) ^Z'^) 

then a.a.s. = 0, and 

(2) ^/ 

r = tu ^n~^(2FfT) j Q^^d f = ((log n/n) ^^'^) 

then a.a.s. Hj. 7^ 0. 

Similarly for C{n,r) , we have the following. 

Theorem 6.8 (Intervals of vanishing and non- vanishing, random Cech 
complex). Fix k > 1. For a random Cech complex on a uniform dis- 
tribution on a smoothly bounded convex body, 

(1) ^/ 

r = o ^n ''('=+1) j or r = a; ((log n./n)"'^/^) 

then a.a.s. Hk = 0, and 

(2) ^/ 

r = u ^r?,~d(fe+i) j r = ((logn./n)"'^/'^) 

then a.a.s. 7^ 0. 

Proof. In both cases, (1) follows from the results we have established in 
the sparse regime. The point of the Lemma is that as long as r falls in 
this intermediate regime, there IS cL.cL. s. at least one connected compo- 
nent homeomorphic to the sphere S"^, hence contributing to homology 
Hk. □ 

7. Further directions 

From the point of view of applications to topological data analysis, 
the thing that is most needed is results for statistical persistent ho- 
mology [Sj. Bubenik and Kim computed persistent homology for i.i.d. 
uniform random points in the interval [7] applying the theory of order 
statistics, but so far these are some of the only detailed results for per- 
sistent homology of randomly sampled points. (More recently Bubenik, 
Carlsson, Kim, and Luo discussed recovering persistent homology of a 
manifold with respect to some fixed function by data smoothing with 
kernels, and then applying stability for persistent homology [6].) 
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The theorems in this article have imphcations for statistical persis- 
tent homology. In particular, we have bounded the number of nontriv- 
ial homology classes, and since almost all of the homology comes from 
vertex minimal spheres, almost all classes should not persist for long. 
What one might like is to rule out homology classes that persist for 
a long time altogether. Such a theorem would be an important step 
toward quantifying the statistical significance of persistent homology. 

All the results here are stated for Euclidean space, but we believe 
this is mostly a matter of convenience. Analogous results for homology 
should hold for d-dimensional compact Riemannian manifolds. The 
manifold will contribute its own homology in the supercritical regime, 
but for most functions r = r{n) this will be overwhelmed by noise, 
since E[Pk] — ^ oo and the homology of the manifold itself will be finite 
dimensional. In contrast, one would expect persistent homology to 
detect the homology of the manifold itself. 

Although we have bounded Betti numbers here, coefficients have not 
come into play. It seems more refined tools are needed to detect the 
torsion in Z-homology of random complexes. (This comes up for other 
kinds of random simplicial complexes as well; see for example [2].) 

Finally, we comment that the topological properties studied here 
are not monotone, the results suggest strongly that they are roughly 
unimodal. But can this be made more precise? For example, can one 
show that for sufficiently large n, E[l3k] is actually a monotone function 
of r? Similar statistically unimodal behavior in random homology has 
been previously observed in [18j and |19] . 
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Appendix: discrete Morse theory 

In this section we briefly introduce terminology of discrete Morse 
theory and state the main theorem. For a more complete introduction 
to the subject we refer the reader to [T3] . 

For two faces a, r of a simplicial complex, we write o" -< r if cr is a 
face of T of CO dimension 1. 

Definition 7.1. A discrete vector field V of a simplicial complex A is 
a collection of pairs of faces of A {a -< (3} such that each face is in at 
most one pair. 

Given a discrete vector field V, a closed V-path is a sequence of faces 

ao^ /3o>- ai-< (3iy ... ^ (3ny a„+i, 
with aj+i 7^ tti such that {ctj -< f3i} E V for i = 0, . . . ,n and = ao- 
(Note that {(3i >- aj+i} ^ V since each face is in at most one pair.) 
We say that ^ is a discrete gradient vector field if there are no closed 
l^-paths. 

Call any simplex not in any pair in V critical. Then the main theorem 
is the following [13] . 

Theorem 7.2 (Fundamental theorem of discrete Morse theory). Sup- 
pose A is a simplicial complex with a discrete gradient vector field V. 
Then A is homotopy equivalent to a CW complex with one cell of di- 
mension k for each critical k-dimensional simplex. 

Simply counting cells is an extremely coarse measure of the topology 
a complex, but it can be enough to completely determine homotopy 
type; for example a CW complex with one 0-cell and all the rest of its 
cells (i- dimensional is a wedge of ci-spheres. 

In all cases, if fk is the number of cells of dimension k, then the 
definition of cellular homology gives that Pk < fk, and this is the 
main fact that we exploit in Sections \5\ and E] to bound the expected 
dimension of homology. 

References 

[1] Noga Alon and Joel H. Spencer. The probabilistic method. Wiley-Interscience 
Series in Discrete Mathematics and Optimization. John Wiley & Sons Inc., 
Hoboken, NJ, third edition, 2008. With an appendix on the life and work of 
Paul Erdos. 

[2] Eric Babson, Christopher Hoffman, and Matthew Kahle. The fundamental 
group of random 2-complexes. J. Amer. Math. Soc, 24(l):l-28, 2011. 

[3] Yuliy Barishnokov. Quantum foam, August 2009. (Talk given at AIM Work- 
shop on "Topological complexity of random sets"). 



RANDOM GEOMETRIC COMPLEXES 



25 



A. Bjorncr. Topological methods. In Handbook of combinatorics, Vol. 2, pages 
1819 1872. Elsevier, Amsterdam, 1995. 

Bela BoUobas and Oliver Riordan. Clique percolation. Random Structures Al- 
gorithms, 35(3):294-322, 2009. 

P. Bubenik, G. Carlson, P.T. Kim, and Z.M. Luo. Statistical topology via 
Morse theory, persistence and nonparametric estimation. Algebraic Methods in 
Statistics and Probability II, 516:75, 2010. 

Peter Bubenik and Peter T. Kim. A statistical approach to persistent homol- 
ogy. Homology, Homotopy AppL, 9(2):337-362, 2007. 

Gunnar Carlsson. Topology and data. Bull. Amer. Math. Soc. (N.S.), 
46(2):255-308, 2009. 

Erin W. Chambers, Vin de Silva, Jeff Erickson, and Robert Christ. Vietoris- 
Rips complexes of planar point sets. Discrete Comput. Geom., 44(l):75-90, 
2010. 

David Cohen-Steiner, Herbert Edelsbrunner, and John Harer. Stability of per- 
sistence diagrams. Discrete Comput. Geom., 37(1):103-120, 2007. 
Persi Diaconis. Application of topology. Quicktime video, September 2006. 
(Talk given at MSRI Workshop on "Application of topology in science and 
engineering," Quicktime video available on MSRI webpage). 
Herbert Edelsbrunner and John Harer. Persistent homology — a survey. In Sur- 
veys on discrete and computational geometry, volume 453 of Contemp. Math., 
pages 257-282. Amer. Math. Soc, Providence, RI, 2008. 

Robin Forman. A user's guide to discrete Morse theory. Sem. Lothar. Combin., 
48:Art. B48c, 35 pp. (electronic), 2002. 

M. Gromov. Hyperbolic groups. In Essays in group theory, volume 8 of Math. 
Sci. Res. Inst. PubL, pages 75-263. Springer, New York, 1987. 
M. Gromov. Random walk in random groups. Geom. Funct. Anal., 13(1):73- 
146, 2003. 

Allen Hatcher. Algebraic topology. Cambridge University Press, Cambridge, 
2002. 

Jean-Claude Hausmann. On the Vietoris-Rips complexes and a cohomology 

theory for metric spaces. In Prospects in topology (Princeton, NJ, 1994), vol- 
ume 138 of Ann. of Math. Stud., pages 175-188. Princeton Univ. Press, Prince- 
ton, NJ, 1995. 

Matthew Kahle. The neighborhood complex of a random graph. J. Combin. 
Theory Ser. A, 114(2):380-387, 2007. 

Matthew Kahle. Topology of random clique complexes. Discrete Math., 
309(6):1658-1671, 2009. 

Matthew Kahle and Elizabeth Meckes. Limit theorems for Betti numbers of 
random simplicial complexes, submitted, arXiv:1009.4130, 2010. 

Nathan Linial and Roy Meshulam. Homological connectivity of random 2- 
complexes. Combinatorica, 26(4):475-487, 2006. 

Nathan Linial and Isabella Novik. How neighborly can a centrally symmetric 
polytope be? Discrete Comput. Geom., 36(2):273 281, 2006. 
R. Meshulam and N. Wallach. Homological connectivity of random k- 
dimensional complexes. Random Structures Algorithms, 34(3):408-417, 2009. 



26 



MATTHEW KAHLE 



[24] Partha Niyogi, Stephen Smale, and Shmuel Weinberger. Finding the homology 

of submanifolds with high confidence from random samples. Discrete Comput. 

Geom., 39(1-3):419-441, 2008. 
[25] Partha Niyogi, Stephen Smale, and Shmuel Weinberger. A topological view of 

unsupervised learning from noisy data, to appear, 2010. 
[26] Mathew Penrose. Random geometric graphs, volume 5 of Oxford Studies in 

Probability. Oxford University Press, Oxford, 2003. 
[27] Nicholas Pippenger and Kristin Schleich. Topological characteristics of random 

triangulated surfaces. Random, Strnctures Algorithms, 28(3):247-288, 2006. 
[28] L. Vietoris. Uber den hohcrcn Zusamnienhang konipakter Raume und eine 

Klasse von zusammenhangstreuen Abbildungen. Math. Ann., 97(l):454-472, 

1927. 

[29] Afra Zomorodian and Gunnar Carlsson. Computing persistent homology. Dis- 
crete Comput. Geom., 33(2):249-274, 2005. 

Department of Mathematics, Stanford University 
E-mail address: mkahle@math.stanford.edu 



